Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition
نویسندگان
چکیده
This paper presents an attempt to introduce unvoiced landmarks into statistical continuous speech recognition system. The unvoiced landmark detection algorithm proposed here locates the points in speech where the vocal folds stop or begin freely vibrating. In our experiments, 87.47% of stops and 98.94% of fricatives are segmented from speech after the unvoiced landmark detection, with a very low insertion error rate of 0.13%. Then these landmarks are incorporated into decoding process of segment model based recognizer as search beginning indicators. The effectiveness of landmark detection algorithm is verified in our landmark-guided recognition system with 240 sentences in 863Test database.
منابع مشابه
Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
In this paper, we propose a framework that attempts to incorporate landmarks into a segment-based Mandarin speech recognition system. In this method, landmarks provide boundary information and phonetic class information, and the information is used to direct the decoding process. To prove the validity of this method, two kinds of landmarks that can be reliably detected are used to direct the de...
متن کاملA novel path extension framework using steady segment detection for Mandarin speech recognition
Frame based decoders are short of using long span of time knowledge while segment based decoders often confuse with complex calculating. This paper proposes a novel decoding framework by integrating steady speech segments information into path extension procedure. Firstly, as baseline decoding system, a dynamic lexicon-tree copy recognizer is developed, which aims to accelerate popular frame ba...
متن کاملMandarin Chinese tone nucleus detection with landmarks
This paper discusses a new approach to improve tone recognition by modeling the tone nucleus with vowel landmark detection. The tone nucleus region is identified based on vowel landmark frames derived by an automatic landmark recognition system. In the corresponding tone recognition experiments, the best results with landmark-based tone nucleus regions outperform the best baseline system result...
متن کاملRobust F0 modeling for Mandarin speech recognition in noise
The F0 contour plays an important role in recognizing spoken tonal languages like Mandarin Chinese. However, the discontinuity of F0 between voiced and unvoiced transition has traditionally been a bottleneck in creating a succinct statistical tone model for automatic speech recognition applications. By applying successfully the Multi-Space Distribution (MSD) to tone modeling, we recently report...
متن کاملGeneration of Fundamental Frequency Contours of Mandarin in HMM-based Speech Synthesis using Generation Process Model
The HMM-based speech synthesis system can produce high quality synthetic speech with flexible modeling of spectral and prosodic parameters. In this approach, short term spectra, fundamental frequency (F0) and duration are generated by multi-stream HMMs separately. However the quality of synthetic speech degrades when feature vectors used in training are noisy. Among all noisy features, pitch tr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006